DWT-based classification of acoustic-phonetic classes and phonetic units
نویسندگان
چکیده
In this paper, we describe a new algorithm based on the discrete wavelet transform (DWT) which uses a multithreshold decision model (MTD model) to detect acoustic and phonetic classes (based on 10ms speech signal segments). The best thresholds of the model are found by using experimental pattern classification. Then a unit level interpolation technique is combined with the MTD model to classify phonetic units (based on sequences of 10ms segments). The results of the classifiers are compared and jointly adjusted by an interactive scheme (IS) in order to improve the performance of the algorithm. The algorithm is tested with the TIMIT database and compared with the SUB-CRA-based algorithm and other algorithms to demonstrate its effectiveness.
منابع مشابه
An approach to obtain weighted graphs of words based on phoneme detection
In this paper, we present an approach for phoneme detection and phonetic classification that can be used as a basis for different speech processes, such as phoneme boundary detection, acoustic-phonetic decoding or word-graph construction with acoustic confidence scores. The phonetic classifier that has been developed is based on a phase of acoustic vector clustering in the space of acoustic cha...
متن کاملSignificance of group delay based acoustic features in the linguistic search space for robust speech recognition
In this paper we discuss the complementarity of the group delay features with respect to other conventional acoustic features and also propose the use of such diverse information in the linguistic search space for robust speech recognition. A discriminability analysis is carried out on various classes of phonetic units. A class based phonetic unit analysis is conducted to compare the suitabilit...
متن کاملKnowledge based approach to consonant recognition
This paper presents a knowledge based approach to consonant recognition. In traditional knowledge based systems, the expert is the linguist/phonetician who attempts to describe and quantify the acoustic events, in the form of production rules into phonetic description. This paper proposes to alter the expert's role so that the expert only needs to provide the basic structure of the phonetic cla...
متن کاملUsing Chi-Square Testing in Modeling Confusion Characteristics for Robust Phonetic Set Generation
A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. In order to obtain better phonetic representation, context-dependent units are used to model co-articulation effects between phones and have been broadly in speech recognition. However, this representation generally increases the number of recognition ...
متن کاملSyllable structure based phonetic units for context-dependent continuous Thai speech recognition
Choice of the phonetic units speech recognizer is a factor greatly affecting the system performance. Phonetic units are normally defined according to the acoustic properties of a speech. Nevertheless, with the limit of training data, too delicate acoustic properties are ignored. Syllable structure is one of the properties usually ignored in English phonetic units due to a lot of possible onsets...
متن کامل